Adaptive Algorithms for Pitch-synchronous Speech Signal Segmentation
نویسنده
چکیده
The paper describes and compares two time domain algorithms for segmenting voiced speech into quasiperiodical units that correspond to pitch periods. The first algorithm uses the similarity of adjacent segments to build a graph that represents distances between them and finds the minimal path in it using a "greedy" algorithm. The second algorithm implements a set of heuristics that imitate actions of a human, who manually solves the problem. It starts from the middle of the voiced segment, finds the highest waveform peak, and then using an estimate of fundamental frequency period searches for peaks on the both sides. Some rules are applied to decide on the direction of processing, to prune errors at the beginning and at the end of the signal and to cope with jitter. The experimental results of the algorithms’ performance are presented. The algorithm’s application to precise pitch estimation is discussed.
منابع مشابه
Automatic Segmentation of Greek Speech Signals to Broad Phonemic Classes
In this paper, we evaluate an implicit approach for the automatic detection of broad phonemic class boundaries of continuous speech signals. The reported method is consisted of the prior segmentation of speech signal into pitch-synchronous segments, using pitchmark locations, for the computation of adjacent broad phonemic class boundaries. The approach’s validity was tested on a phonetically ri...
متن کاملAdaptive Segmentation with Optimal Window Length Scheme using Fractal Dimension and Wavelet Transform
In many signal processing applications, such as EEG analysis, the non-stationary signal is often required to be segmented into small epochs. This is accomplished by drawing the boundaries of signal at time instances where its statistical characteristics, such as amplitude and/or frequency, change. In the proposed method, the original signal is initially decomposed into signals with different fr...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کاملAn Adaptive Segmentation Method Using Fractal Dimension and Wavelet Transform
In analyzing a signal, especially a non-stationary signal, it is often necessary the desired signal to be segmented into small epochs. Segmentation can be performed by splitting the signal at time instances where signal amplitude or frequency change. In this paper, the signal is initially decomposed into signals with different frequency bands using wavelet transform. Then, fractal dimension of ...
متن کاملHybrid electroglottograph and speech signal based algorithm for pitch marking
Pitch marking is very significant in speech signal processing. In a text-to-speech (TTS) system based on the Time-Domain Pitch-Synchronous Overlap-Add (TD-PSOLA) method, robust estimation of pitch marks (PM) is especially important to the modification of the time and pitch scale of a speech signal in order to match it to that of the target speaker. The aim of this paper is to improve the accura...
متن کامل